Are Scale Free Networks Better? 
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We study the performance of WeibuU and scale free Internet-like networks and compare them to 
a classical random graph based network. The scaling of the traffic load with the nodal degree is 
established, and confimed in a numerical simulation of the TCP traffic. The scaling allows us to 
estimate the link capacity upgrade required making and extra connection to an existing node. 
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I. INTRODUCTION 

Random graphs were studied since the middle of the 
20th century with the initiative works of mathematicians 
Pal Erdos and Alfred Renyi. In 1960 they published their 
paper On the evolution of random graphs, with the first 
thorough study of the graph theory J,] . However, these 
results were strictly theoretical, since tools to measure 
real-world random graphs at that time were not avail- 
able. With the evolution of the personal computers, we 
have now the possibility to study this topic in practice, 
and, interestingly, the computers themselves provide one 
of the most exciting real random graphs: the computer 
networks. 

As these networks evolved, their properties were ana- 
lyzed, and soon it was clear, that the Erdos-Renyi models 
(ER) were not appropriate for these graphs . The most 
important difference was the scale free property, that the 
computer networks have, but the classical models lack. 
It turned out, that this feature is shared by other types 
of networks, like social 3] and metabolic Q ones. It 
means that the distribution of the degrees of the nodes 
follows a power-law distribution, while the classic graphs 
has a Poisson degree distribution, with an exponential 
tail. It shows that, in the real-world graphs, nodes with 
a high number of connections are much more likely than 
expected. Therefore new models were necessary to de- 
scribe these kind of random graphs. 

One of these models was developed by Laszlo Barabasi 
and Reka Albert (BA model) Their method has 

two key features: incremental growth and preferential at- 
tachment. Incremental growth means that the graph is 
constructed by adding nodes to the existing graph, and 
connecting them according to a construction rule, con- 
trary to the original Erdos-Renyi picture with a static 
graph. Preferential attachment means the the likeli- 
hood of a connection depends on the degree of a node. 



again lifting the classical assumption of equal probabili- 
ties. This model has more variations, and they are able to 
describe a wide class of random graphs. There is another 
class of models study such networks without incremental 
growing, purely on their statistical properties 0- 

In this paper, we investigated network properties of 
BA-like models with different parameters and raise the 
question whether non-classical graphs may perform bet- 
ter transmitting data over them. Certain value of pa- 
rameters yield a graph with an exponential tail degree 
distribution, hence allowing us to compare these classi- 
cal type models to the ones with power-law distribution. 
In Section II we analyze the properties of the extended 
BA models, estimating their node distribution. Next, in 
Section III we construct networks based on these mod- 
els, and simulate a network traffic on such a graph in a 
simplified model estimating the traffic load on the nodes. 
The scaling of the load with nodal degree is presented, al- 
lowing to estimate the proper bandwidth allocation when 
upgrading a node. In Section IV we study a more realistic 
setup with TCP dynamics and compare the theoretical 
result of the previous section to the simulated ones, while 
in Section V we discuss the overall performance of the 
different simulations. Finally, we conclude out analysis. 



II. THE MODEL 

In the original BA-model the graph is constructed as 
follows. Starting from a small initial graph, we extend it 
by adding a new node in each step and connecting it to 
TO randomly selected existing nodes. The probability of 
choosing a particular node is proportional to its degree, 
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This model constructs a scale free graph, where the cu- 
mulative degree distribution has a power-law decay with 
an exponent 2 j^. 

In the following, we modify the construction law, and 
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use the more general form 
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weighting the probabihty with a power a of the nodal 
degree, similarly to Ref. [g- The BA model corresponds 
to a=l, while for a = the preferential connectivity is 
cancelled, and we are back to a classical ER-like graph 
model with a uniform distribution, leading to exponential 
node distribution. While this model is similar to the 
original ER model, it differs in some aspects, such as 
it has a minimal guaranteed degree, and the orderingof 
the nodes presents nonzero correlations in the degrees Q ■ 
With a € (0, 1), the models provide a smooth transition 
between the classical and the scale free models. 

The degree distribution for these models can be derived 
following the method introduced in Q. Here we give 
a fast estimate on the distribution, for the exact result 
see 111- First, we estimate the rate of growth of the degree 
at each node, assuming that the growth of the degree is 
continuous in time. At time t there are exactly t nodes 
and mt links between them. Hence, the expected degree 
value ki = E{di) of node i is growing as 
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For a = 0, the denominator is simply conunting the num- 
ber of nodes, and is equal to t, while for a = 1, it is 
(double) counting the number of links, and thus is 2m t, 
both being a linear function of the time t. Numerical 
simulations showed that for a G (0, 1) the denominator 
is well approximated by a linear function ct, where c is 
the a dependent measured slope. Hence, generally our 
differential equation can be written as 



ct 



(4) 



Fortunately, these equations are solvable for each a in 
the choosen (0, 1) range. Specifically, for a = we get 



fc,(t) 



\ogt 



and more generally for a E (0, 1) 

Finally, for the scale free case a 
changes, and we arrive at 
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= 1 the structure 
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In order to get a formula for the degree distribution, note 
that for each i, ki{i) = m, since at the time the node is 
added it has exactly m connections. The growth process 



for node i has a similarity to the growth of the previous 
nodes, and this suggests that 



ki{t) = fci 



(8) 



The cumulative distribution P{ki > x) counts the num- 
ber of nodes with ki{t) > x, i.e. ki (4) > x, which leads 
to 



t 
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Normalizing back with the total number of nodes at time 
t (being also t) we get the approximate cumulative prob- 
ability 



P[h > x) 
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Inverting Eq. © we find 

P{ki > x) ~ e^T^ 



(10) 



(11) 



the WeibuU-distribution for a G (0, 1) with scale param- 
eter c, and shape parameter 1 — a. We note, that the 
exact result is of the form m 



P{ki > x) ~ a; 



(12) 



where ... denotes higher order correction terms. 

For a = (c = 1) one recovers the classical limit 
P{x) ^ e~^. In the region a G (0, 1) the distribu- 
tion still vanishes exponentially, so strictly speaking no 
heavy tails are present, however, the probability of find- 
ing nodes with large degree increases dramatically. Nu- 
merical studies revealed, that the distribution indeed fol- 
lows the form (|ll|l . however, the parameters are slightly 
different from the predicted ones, due to the approxima- 
tion used in In the limit a = 1 (c = 2) we arrive at 
the genuine scale free heavy tail result 



P{x) 



(13) 



We note, that the analysis can be extended to a > 1, 
however, such a distribution actually tends to the degen- 
erate case with one node being connected with all others 
in a star-like topology. For a = 1 the original BA model 
results in c = 2, however, different growth stategies are 
able to vary the value of c, hence the decay exponent. In 
this work we restrict ourselves to the original BA model. 



III. ESTIMATION OF THE NETWORK LOAD 
IN A SIMPLIFIED MODEL 

In the following, first we generate random graphs on a 
computer according to the extended BA model and 
study the load in a simplified network model. We as- 
sume, that each node generates a constant traffic (data 
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FIG. 1: Tail of cumulative distributions of the nodal degree, load and throughput for geometries with a =0 (ER graph, left), 
0.5 (middle) and 1 (BA graph, right). 



stream) to all the other nodes. The amount of traffic is 
the same for all connections, and furthermore, we assume 
that the link capacity can handle the accumulated traf- 
fic. The properties of the network are analysed tuning 

the geometrical parameter a. 

For each value of a we performed measurements on 
8 different graphs (generated with the same statistical 
properties), and then averaged. Each topology was made 
out of N = 100000 nodes and m = 3 links per node. 
The traffic was routed using the standard shortest-path 
strategy, and the number of data streams, passing a node 
was counted for each node. Since by construction at least 
m{N — 1) data channel arc open per node, we define the 
load li, to be the number of data channels divided by A'', 
the number of nodes, at node i. In the large N limit this 
gives the number of links m, if no "foreign" connection 
is going through the selected node. One can regard this 
quantity as "weighted degree" , with weights equal to the 
link loads. 

To study the effect of congestion, we assume, that for 
N nodes a data stream occupies 1 /N part of the link ca- 
pacity. With N{N—1) connections there will be certainly 
"overused" links, where the link bandwidth capacity con- 
strains the througput of the node. Thus we define the 
throughput of a link to be the minimum of the load and 
the link capacity, while the throughput of the node to be 
the sum of the link troughputs over the links connected 
to the given node. 

Figure 1 shows the tails of the cumulative distribu- 
tion functions (1.0 — CDF) for degrees, load and (node) 
throughput on graphs with q:=0, 0.5 and 1. The distribu- 
tion of the degrees show the expected tails, exponential 
{a = 0, ER graph), WcibuU {a = 0.5), and the power-law 
(a = 1, BA graph), respectively. However, the distribu- 
tion of load shows interesting deviations from the one of 
the nodal degree. For a = the weighted nodal degree 
(load) distribution can be approximated much better by 
a Weibull distribution than an exponential one, describ- 
ing the nodal degree distribution. This is a clear indica- 
tion, that the distribution of the network traffic even in 



a simulated "uniform" situaton does not follow one-by- 
one the underlying network topology, rather developes a 
heavier tail. For a = 0.5 the load remains Weibull, how- 
ever, the shape parameter (theoretically 1 — a), changes 
from the numerical value 0.62 to 0.32. In case of a = 1 
the power-law decay survives, but with changing CDF 
tail exponent, dcscrcasing from the numerical value 1.97 
to 1.25. We may conclude, that the load distribution 
has considerably fatter tails than the underlying nodal 
distribution. 

The pcr-node throughput shows a transition between 
the nodal degree and the load distribution. For low 
throughput values it follows the load distribution (the 
bandwidth is enough to hold the traffic), however, at 
higher values it approaches the nodal degree distribution, 
simply counting the number of links. The transition is 
governed by the link capacities, in a real network en- 
vironment with large available capacity one expects the 
througput to follow the load distribution, however, in an 
underdesigned network it follows the degree distribution, 
as we show in the next section. 



The change of the distribution from exponential to 

Weibull, and the parameter changes in the Weibull and 
in the power-law case in the load distribution suggests to 
examine the correlation between the degree and the load. 
This is shown in Figure 2 (loft) on a scattcr-plot. Since 
it is log-log scaled, the linear clusters indicate a power- 
law correlations, Z; = . The dependence of the scaling 
exponent /3, on the network geometry parameter a, is 
shown in Figure 2 (right). The load is pushed to have 
fatter tails than the degree distribution, and the more 
"classical" is the network the larger the deviation. For 
the BA geometry the the load distribution decays ~ 1.6 
times faster than the corresponding degree distribution. 

This means that not only the distribution of the load 
is more heavy-tailed than the distribution of the degrees, 
but also means that this dependence is quite strictly a 
power-law function. It also explains the deviations from 
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FIG. 2; The degree-load joint distribution for different network geometries (left). Scaling exponent P, of the load with the 
network geometry parameter a (right). 



the degree distributions, since a power of an exponen- 
tially distributed (a = 0) random variable is WeibuU, 
while in the case of Weibull or power-law degree dis- 
tributon the transformation results only in a parame- 
ter change for the distribution. It seems that the traf- 
fic pushes the distribution into heavier tails, from the 
exponential distribution, and converging to a power-law 
through WeibuUs. The initial push at a = is extremly 
high, with an exponent 1/2 in the Weibull distribution. 
The load distribution is much less sensitive to the under- 
lying network as the degree distribution. 

Since the nodal throughput is bounded from above by 
the degree, it limits the throughput on the high-degree 
nodes, where the links are already fully utilised. There- 
fore the throughput-degree joint distribution is different 
from the load-degree distribution, the power-law correla- 
tion is only valid for low and medium-degree nodes. But 
in real life the congestion at the overloaded nodes also 
affects the other parts of the network, since every data 
flow through these nodes is jammed. Furthermore, the 
TCP dynamics is also known to be chaotic ^3 which 
may change the scalings observed in a simpler model. To 
simulate the real life situation, and compare them with 
the results of the simplified model, next, we simulate a 
realistic traffic on a computer network, too. 

IV. ESTIMATION OF THE NETWORK LOAD 
IN A SIMULATED TRAFFIC 

In order to compare the theoretical results from the 
previous section to a more realistic setup, we simulated 
a TCP/IP network with N = 1024 nodes, m = 3 links 
per node, and a uniform link bandwidth of 1 Mb/s. Ev- 
ery node communicated with a randomly selected target 
node. To study the effect of congestion we modelled three 
scenarios: a low, a medium, and a high traffic one, us- 
ing constant bitrate data flows of 16 Kb/s, 64 Kb/s, and 



256 Kb/s, respectively. The simulation was ran using 
the Berkeley Network Simulator package The link 

throughput was calculated as the number of packets sent 
through that link, and the nodal values as the sum of the 
throughput of the incident links. 



To our surprise, in each scenario the throughput 
showed a good scaling through the whole degree range. 
Since the throughput at the high-degree nodes is obvi- 
ously limited, it must mean that the congestion limits 
the throughput of the other nodes in such way that the 
power-law throughput-degree correlation remains valid. 
It also means that the congestion affects each other node 
proportionally to its degree. 

The correlation exponent (3 was, however, different in 
the three simulation. In the low-trafflc scenario the mea- 
sured distribution is exactly the same as obtained from 
the numerical simulations of the load in the previous 
section. The congestion still has not set up, and the 
throughput is identical to the load. As the traffic inten- 
sity grows, exponent (3 decreases, and flattens. It shows 
that the heavy traffic is more evenly spreads through the 
network, but its dependence on the degrees remains a 
power-law function. 

V. OVERALL PERFORMANCE 

The next quantity we studied is the total number 
of transferred packets in the simulated network. Each 
node transmits constantly TCP packects to its randomly 
choosen partner, and if a packet arrives, an acknowledge- 
ment is sent back to the originating node. We counted 
the number of packets for which the acknowledgement 
was received and the difference betweeen the number of 
transmitted packets and acknowledged packets is the loss. 
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FIG. 3: Simulated traffic on an a = 1 geometry network. The degree-load joint distribution for different network traffic (left). 
Scaling exponent /3, of the load with the network geometry parameter a for low, medium and high traffic(right). 



The source of the loss of packets in the network is the 
congestion: whether the link capacity cannot handle the 
amount of traffic, or the nodes in between cannot cope 
with the routing of the packages. One would expect, 
that in a network with smaller shortest pathes between 
two randomly choosen nodes the load on the links and 
routers is higher, hence using the same devices it would 
drop the packets more often as a network with larger 
shortest path. 

It is also known, that scale free network have smaller 
shortest pathes connecting two arbitrary nodes |^ ^2 j 
i.e. a scale free network uses less routing devices, how- 
ever, the load on them is higher. Indeed, numerical sim- 
ulation, performed in the previous section also showed, 
that with increasing traffic the performance of the scale 
free (BA) network downgrades, for example, with a drop 
rate of 24% already at medium traffic, while the classical 
(ER) network show a downgrade only of 6% for the same 
traffic. 



scenario 


a = 0.0 


a = 0.5 


Q = 1.0 


low traffic 


0.59 


0.59 


0.57 


medium traffic 


2.21 


2.09 


1.74 


high traffic 


3.42 


3.26 


2.96 



TABLE 1: The performance of the networks in millions of 
packets successfully sent 



ton does not follow one-by-one the underlying network, 
rather it developes fatter tails than the nodal degree dis- 
tribution of the network. A scaling between the nodal 
degree distribution and the load on a node was estab- 
lished, showing a power like patter, I ~ d^^°'\ where 
the scaling exponent (3, is a decreasing function of the 
network parameter, a. For networks with fatter nodal 
distribution the exponent is smaller. As a consequence, 
attaching a new connection to the node requires less up- 
grade in the bandwidth for a scale free network to keep 
the performance, as for a classical (ER) one. 



The above theoretical result was confirmed by simula- 
tion, the per-node throughput is still scaling with nodal 
degree. The scaling dependes on the amount of the traf- 
fic, for a completely congested situation the throughput 
distribution by definition agrees with the nodal degree 
distribution, hence the scaling exponent is 1, however, for 
partly congested or congestion free networks this scaling 
approaches the load distribution with scaling exponents 
in the range 1.4 (partially congested BA network) to 2 
(congestion free ER). A scale free network requires less 
resource upgrade when a new node is added. 



VI. CONCLUSION 

In this paper we showed, that the distribution of the 
network traffic even in a simulated "uniform" situa- 



The overall performance of a scale free network is de- 
creasing rapidly with the traffic, where the classical net- 
work still has almost no loss. It is due to the feature, 
that the ER network uses more routers along the short- 
est connection, and hence the traffic is distributed more 
evenly. 
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